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The present application claims priority to U.S. provisional application SN 60/038,1 17, 
filed February 20, 1997. 



FIELD OF THE INVENTION 



5 



The present invention relates to distinguishing particular species of bacteria belonging to 



the genus Shigella and discriminating Shigella from E. coli. More specifically, an aspect of the 
invention provides species-specific and genus-specific identifying nucleotides for the design of 
nucleic acid probes for the detection and identification of Shigella species and for the 
discrimination between Shigella and E. coli, compositions thereof, and methods for their use. 



Shigellosis, also known as bacillary dysentery, is caused by several bacteria of the genus 
Shigella. Symptoms include diarrhea, abdominal pain, vomiting and fever. Generally, foodborne 
shigellosis involves a short incubation time, but symptoms can persist for up to 14 days. As few 
15 as 10 to 100 organisms have been shown to cause illness and secondary infections occur 

frequently. In the United States, Shigella accounts for up to 25% of the total cases of intestinal 
disease. The actual number of cases is estimated to be currently about 350,000. The 
proliferation of childcare facilities is playing an increasing important role in disease outbreaks. 

2 0 Shigella infection is a serious public health problem in the United States and, as such, 

physicians are required to report cases to the Centers for Disease Control and Prevention. 
Further, recent changes in government regulations will make it incumbent upon the food industry 
to report enteric bacterial contamination of food. Public Health Laboratories, operated by the 
States, often must test specimens provided by physicians in order to identify the specific species 

25 of Shigella causing infection. 

Shigella is an invasive pathogen that can be recovered from the bloody stool of an 
infected host. Invasive pathogens colonize the host ! s tissues as opposed to growing on tissue 
surfaces. Therefore, infections are also associated with mucosal ulceration, rectal bleeding, and 
30 drastic dehydration; fatality may be as high as 10-15% with some strains. Reiter's disease, 

reactive arthritis, and hemolytic uremic syndrome are possible sequelae that have been reported 
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in the aftermath of shigellosis. Infants, the elderly, and the infirm are susceptible to the severest 
symptoms of disease, but all humans are susceptible to some degree. Shigellosis is a very 
common malady suffered by individuals with AIDS and AIDS-related complex, as well as non- 
AIDS homosexual men. 

5 

Transmission is frequently by ingestion of contaminated food or water. The organisms 
survive the gastric milieu and make it to the large intestine where the bacteria attach to specific 
host cells via invasins and then invade the epithelial cells lining the large intestines. Additional 
cell death is produced by elaboration of shiga toxins. Shiga toxin is a cytotoxin (or enterotoxin) 
0 that destroys epithelial cells lining the lumen of the large intestine; this causes bloody diarrhea 
characteristic of Shigella infection. Plasmid-mediated invasins determine the virulence of the 
particular strain of Shigella. Symptoms develop after an incubation period of 24 to 48 hours and 
include fever, abdominal pain and diarrhea. 

5 The genus Shigella is a member of the family of Enterobacteriaceae and is thus related to 

Escherichia colt Their DNA relatedness is very high, they are often difficult to differentiate 
biochemically, and they cross-react serologically. However, they have remained separate species 
for clinical reasons. Enterohemorrhagic E. coli (EHEC) is a defined subset of toxin-producing 
Shigella and at least one serotype (0157:H7) of EHEC can cause hemorrhagic colitis and 

0 hemolytic uremic syndrome, a potentially fatal complication. 

The four species in the Shigella genus are sometimes referred to by a letter designation 
based on their serological antigen: 
Serotype A- S. dysenteriae 
5 Serotype B- S.flexneri 

Serotype C- S. boydii 
Serotype D- S. sonnei 

Serotype D is the causative agent of most cases of SV//ge//<2-related diarrhea. Although all 
Shigella species have been implicated in foodborne outbreaks at some time, S. sonnei is thought 
0 to be the leading cause of shigellosis from food. The other species are more closely associated 
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with contaminated water. One in particular, S.flexneri, is now thought to be in large part 
sexually-transmitted. 

Current laboratory methods for detecting Shigella depend on growing the organism from 
5 a stool or food sample on culture plates and distinguishing it from other organisms on the basis 
of growth characteristics and biochemical tests. However, conventional testing currently is 
difficult. Growth on MacConkey agar may yield colonies of Shigella and E. coli that are both 
colorless, lactose negative, and about 2-3 mm in diameter. Growth of both organisms on sheep 
blood agar yields colonies that are alike, having a smooth appearance and 2-3 mm in diameter. 

1 0 Tube testing is widely used in reference and public health laboratories. Unfortunately, the media 
and tests used are not well standardized, and few laboratories use exactly the same procedures. 
Some 47 biochemical reactions are numerated to help distinguish between named species, 
biogroups, and enteric groups of the family Enter obacteriaceae. These tests are labor-intensive 
and costly. A genetic probe to a virulence plasmid has been developed by FDA and is currently 

15 under field test. Blood serology can also be used to distinguish one species from another. 

Although the Centers for Disease Control require distinguishing among the four major species of 
Shigella in its reporting statistics, it is difficult to do so. 

Distinguishing between Shigella and E. coli is important for treating the infection. 

2 0 Diarrhea and other symptoms caused by infection with Shigella is one form of bacterial-caused 
intestinal inflammation in which antibiotic therapy has been clearly shown to reduce the duration 
and severity of symptoms and to shorten the period of fecal excretion of the organism. Thus, 
antibiotic therapy is generally recommended for all patients with Shigella diarrhea, except those 
with mild, self-limited disease. The recommended antibiotic is trimethoprim-sulfamethoxazole. 

2 5 Ampicillin-resistant strains are becoming a problem. There is no acquired immunity to infection 
and no vaccine exists. In contrast, there is no evidence that use of antimicrobial agents to treat E. 
coli 0157:H7 disease changes the course of the disease or is beneficial in any way. 
Distinguishing among the species of Shigella is not only important for proper reporting to the 
centers for Disease Control, but also for determining differing susceptibilities to antibiotics. 
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U.S. Patent 4,724,205 relates to a composition for therapeutic treatment or diagnosis of 
the toxin associated with Shigella dysenteriae. U.S. Patent 4,992,364 to Sansonetti et al relates 
to a probe for DNA and a process for detection of Shigella and enteroinvasive strains of E. coli. 
The probe reportedly contained a nucleic acid sequence originating from the 140 mDa virulence 
5 plasmid of the M 90 T strain of Shigella flexnerU which permitted the in vitro diagnosis of 
syndromes of dysentery or diarrhea of the shigellosis type. The probe, however, could not 
distinguish either between E. coli and Shigella or between the different species of Shigella. A 
similar strategy was apparently employed by Lampel and Jagow (U.S. Patent 5,041,372), but had 
the advantage of using synthetic oligodeoxyribonucleotides, which are more stable and easier 

1 0 and cheaper to make then the fragments prepared with restriction enzymes from the virulence 
plasmid. U.S. Patent 5,084,565 to Parodos et al. relates to nucleic acid probes capable of 
specifically hybridizing to rRNA of E. coli and Shigella species and not to rRNA of non- 
E. colil Shigella species. The hybridization assay could not distinguish either between Shigella 
and E. coli or between the different genera of Shigella. Faruque et ah (J. Clin. Microbiol. 

1 5 30(1 1):2996, 1992) reports differentiation of Shigella flexneri strains by rRNA gene restriction 

patterns. Nastasi et al. (Res. Microbiol. 141:1 163, 1990) reports the molecular analysis of strains 
of Shigella boydii isolated in northern and southern Italy. 

Present detection methods using nucleic acid identity are not able to distinguish Shigella 
2 0 species from E. coli or Shigella species from each other. Therefore, known procedures are not 
completely satisfactory, and the present inventors provide herein compositions and methods for 
distinguishing Shigella from E. coli and species of Shigella from each other. 

SUMMARY OF THE INVENTION 

2 5 The present invention seeks to overcome these and other drawbacks inherent in the prior 

art by providing species-specific and genus-specific and, therefore, species-identifying and 
genus-identifying, nucleotides of 16s rRNA (and therefore, also of 16s rDNA) from Shigella 
flexneri, Shigella sonnei, Shigella boydii, and Shigella dysenteriae. The species-specific 
nucleotides provided herein allow detection and identification of Shigella species present in a test 

3 0 sample. "Species-specific," as used herein, means that detection of an identifying nucleotide 

determines presence of that species, while lack of detection of a particular identifying nucleotide 
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does not necessarily mean absence of that species. Further, genus-specific nucleotides are 
provided that distinguish Shigella and E. coli. "Genus-specific," as used herein, means that 
detection of an identifying nucleotide determines presence of that genus, while lack of detection 
of a particular identifying nucleotide does not necessarily mean absence of that genus. 

5 

By "identifying or distinguishing nucleotide" is meant a nucleotide that differs in kind or 
in its presence or absence as compared to an E. co//-equivalent position. Nucleotide, as used 
herein, is meant in include the corresponding nucleoside without phosphate attached, or the 
corresponding free base. One of skill in the art realizes that the specificity relies in the base, not 

1 0 the sugar or phosphate and that the bases will usually be present in the form of a nucleotide for 
purposes of the present invention. Although the sequences presented herein including in the 
sequence listing and tables depict the DNA sequence having a thymine rather than a uracil as 
would be present in RNA, one of skill in the art would realize in light of the present disclosure 
that detection of a uracil base in RNA would be equivalent to detection of a T base in DNA and 

1 5 such equivalents are intended to be included in the scope of the claims. "Identifying or 

distinguishing nucleotide," as used herein, is meant to include a combination of one or more 
nucleotides, the combination being identifying of a species or of a genus. 

While identifying nucleotides and identifying combinations of nucleotides are 
2 0 specifically provided herein from sense strand information, one of skill in the art would realize, 
in light of the present disclosure, that a nucleotide on a complementary strand of nucleic acid 
complementary to an identifying nucleotide is also identifying. 

Accordingly, the present invention provides an identifying nucleotide or identifying 
2 5 combination of nucleotides of 16s ribosomal RNA or 16s ribosomal DNA as set forth in Table 2 
present in a Shigella species and not present in E. coli. The reference numbering system of Table 
2 is the rDNA sequence corresponding to the rRNA sequence of E. coli 16s rRNA strain 7 as 
provided in GenBank as ECORRD, and as provided as SEQ ID NO:7. The sequences of 16s 
rRNA or rDNA of Shigella species regions are aligned to maximally match the E. coli sequence. 
30 To maximize alignment, adjustments are made in the Shigella sequence, not the E. coli 
sequence. For example, as herein described, an insertion of a nucleotide into the Shigella 
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sequence as compared to the E. coli sequence is designated herein as a "p" nucleotide. A 
deletion is designated as an "X" nucleotide, "E. co/z-equivalent position," as used herein, means 
that such an alignment of Shigella sequences has been carried out and an E. coli equivalent 
number has been assigned to a Shigella nucleotide position. 

A further aspect of the present invention is an identifying nucleotide or identifying 
combination of nucleotides of 16s ribosomal RNA or 16s ribosomal DNA as set forth in Table 2 
present in E. coli and not present in a Shigella species. 

Another aspect of the invention is an identifying nucleotide or identifying combination of 
nucleotides of 16s ribosomal RNA or 16s ribosomal DNA as set forth in Table 2 present in one 
species selected from the group consisting of Shigella sonnei, Shigella flexneri, Shigella boydii, 
and Shigella dysenteriae, and not present in the other species of Shigella and not present in E. 
coli. 

Purified nucleic acid molecules used as probe molecules and capable of hybridizing to 
nucleic acid sequences having an identifying nucleotide are part of the present invention. 
Accordingly, a purified nucleic acid molecule capable of hybridizing to a 16s rRNA region or a 
16s rDNA region having a genus-specific nucleotide of Shigella and not to an equivalent 16s 
rRNA region or 16s rDNA region of E. coli, or a nucleic acid molecule complementary to said 
molecule, the molecule thereby capable of distinguishing Shigella from E.coli is a further aspect 
of the present invention. One of skill in the art would realize that a hybridizing molecule 
preferably would be designed to have sequence complementarity to a region of interest. 
Detection of an identifying nucleotide of a complementary strand sequence is intended to fall 
within the spirit and scope of the present invention. An assay kit for distinguishing Shigella 
from E. coli comprising a purified nucleic acid molecule as provided herein packaged in at least 
one container is a further aspect of the invention. 

A further aspect of the invention is a purified nucleic acid molecule capable of 
hybridizing to a 16s rRNA region or a 16s rDNA region having a genus-specific nucleotide of E. 
coli and not to an equivalent 16s rRNA region or 16s rDNA region of a Shigella species, or a 
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nucleic acid molecule complementary to said molecule, the molecule thereby capable of 
distinguishing E. coli from Shigella. An assay kit for distinguishing E. coli from Shigella 
comprising a purified nucleic acid molecule as provided herein packaged in at least one container 
is a further aspect of the present invention. 

5 

A set of probes may be used for determination of a genus or species. A first probe may 
eliminate certain species, while further hybridization with a second probe may then provide a 
definitive determination of genus or species. Accordingly, a first and a second purified nucleic 
acid molecule combination, the first capable of hybridizing to a first 16s rRNA region or 16s 
1 0 rDNA region as set forth in Table 2, the second capable of hybridizing to a second 16s rRNA 

region or 16s rDNA region as set forth in Table 2, the combination of molecules thereby capable 
of distinguishing E. coli, or a Shigella species, or a nucleic acid molecule combination 
complementary to said first and second molecules is an aspect of the invention. 

15 In particular, a combination of molecules for hybridizing to rRNA or rDNA of Shigella 

flexneri and not to rRNA or rDNA of E. coli, Shigella sonnei, Shigella boydii, or Shigella 
dysenteriae, or a set of nucleic acid molecules complementary to said combination is a further 
aspect of the present invention. A particular combination of nucleic acid molecules is wherein 
the first molecule hybridizes to a Shigella flexneri region containing a G nucleotide at position 

2 0 79, and the second molecule hybridizes to Shigella flexneri region containing a G nucleotide at 

position 89 or to Shigella flexneri region containing a C nucleotide at position 92p. An assay kit 
for identifying S. flexneri as distinct from other Shigella species comprising a combination of 
nucleic acid molecules as herein described in at least one container is an aspect of the invention. 

25 A purified nucleic acid molecule capable of hybridizing to 16s rRNA region or 16s rDNA 

region of Shigella sonnei and not to 16s rRNA region or 16s rDNA region of E. coli, Shigella 
flexneri, Shigella boydii, or Shigella dysenteriae, or a nucleic acid molecule complementary to 
said molecule is a further aspect of the present invention. In particular, the molecule may have a 
nucleotide sequence of SEQ ID NO: 20; or may have a nucleotide sequence that detects a C at 

3 0 position 964, or the absence of a base at E. ^//-equivalent position 978. An assay kit for 
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identifying S. sonnei as distinct from other Shigella species comprising a nucleic acid molecule 
as described herein packaged in at least one container is a further embodiment of the invention. 

Another embodiment of the present invention is a nucleic acid molecule capable of 
5 hybridizing to 16s rRNA region or 16s rDNA region of Shigella boydii and not to 16s rRNA 
region or 16s rDNA region of E. coli, Shigella flexneri, Shigella sonnei, or Shigella dysenteriae, 
or a nucleic acid molecule complementary to said molecule. In particular, the molecule may 
have a nucleotide sequence that detects a C at position 92. An assay kit for identifying S. boydii 
as distinct from other Shigella species comprising a nucleic acid molecule as described herein 
10 packaged in at least one container is another aspect of the invention. 

A purified nucleic acid molecule capable of hybridizing to 16s rRNA region or 16s rDNA 
region of Shigella dysenteriae and not to 16s rRNA region or 16s rDNA region of E. coli, 
Shigella flexneri, Shigella sonnei, or Shigella boydii , or a nucleic acid molecule complementary 
; , 15 to said molecule is another embodiment of the present invention. In particular, the molecule may 
»• have a nucleotide sequence that detects an A at position 76. An assay kit for identifying S. 

-q dysenteriae as distinct from other Shigella species comprising a nucleic acid molecule as herein 
^ described packaged in at least one container is an aspect of the invention. 

p2 0 Purified nucleic acid molecules having a nucleotide sequence of the 1 6s rRNA or rDNA 

f; of the Shigella species provided herein are also an aspect of the present invention. Accordingly, 

a purified nucleic acid molecule having a nucleotide sequence as set forth in SEQ ID NO: 3, SEQ 
ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 
SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID 

2 5 NO: 19, SEQ ID NO: 20, or SEQ ID NO: 2 lor a nucleic acid complementary to said purified 

molecule is a further aspect of the present invention. 

A method for testing an unknown sample suspected of having E. coli or Shigella species 
presence comprising demonstrating an identifying nucleotide or identifying combination of 

3 0 nucleotides of 16s rRNA or 16s rDNA as set forth in Table 2 within the sample, wherein the 

demonstration of an identifying nucleotide or identifying combination of nucleotides establishes 

9 
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presence or absence of E. coli or Shigella in the sample is a further embodiment of the present 
invention. The unknown sample may be a clinical sample for diagnosis, a food sample, or an 
environmental sample, for example. The method could be any of a number of methods of 
looking at particular nucleic acid sequences, for example, direct sequencing, dot-blot 
5 hybridization, solution hybridization, Southern or Northern blotting, PCR amplification of a 
target region followed by any of above methods and as described further herein. Preferably, the 
demonstrating is by a method selected from the group consisting of direct sequencing, dot blot 
hybridization, solution hybridization, Northern blotting, and Southern blotting of the unknown 
sample. Further methods may be known to one of skill in the art in light of the present 
1 0 disclosure. 

In particular, the unknown sample may be suspected of containing E. coli and the 
identifying nucleotide is a T at position 88p; the unknown sample may be suspected of 
containing Shigella sonnei and the identifying nucleotide is a C at position 964, or a deletion at 
1 5 position 978; the unknown sample may be suspected of containing Shigella dysenteriae and the 
identifying nucleotide is an A at position 76; the unknown sample may be suspected of 
containing Shigella boydii and the identifying nucleotide is a C at position 92; or the unknown 
sample is suspected of containing Shigella flexneri and the identifying nucleotide is a G 
nucleotide at position 79 in combination with a G at position 89 or a C at position 92p. 

20 

A particular advantage of the present invention is the heretofore nonexistent ability to 
distinguish and identify Shigella versus E. coli using nucleic acid probes and distinguish and 
identify the four Shigella species cited herein using molecular identification of species-specific 
nucleotides or genus-specific nucleotides. Identification of the species- and genus-specific 
2 5 nucleotides, and thereby, identification of the organism, can be now facilitated with increased 
sensitivity, i.e., the ability to detect these bacteria in a given sample more frequently than 
currently available methods, with reductions in assay cost due to less expensive reagents and 
reduced labor, with accurate identification, and with faster results because the test does not 
require the isolation of bacteria from a cultured sample prior to testing. 

30 
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Following long-standing patent law convention, the terms "a" and "an 11 mean "one or 
more" when used in this application, including the claims. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

5 The present invention relates to detecting and distinguishing among bacteria that belong 

to the species Shigella, including Shigella boydii, Shigella dysenteriae, Shigella flexneri, and 
Shigella sonnei; and detecting and distinguishing Shigella bacteria from Escherichia colL In 
particular, the invention includes provision of species-identifying and genus-identifying 
nucleotides of 16s rRNA or 16s rDNA from the above named species. Nucleic acid probes 
1 0 capable of differentiating among these species and genera by hybridization, along with methods 
for their use for specific detection in clinical samples, food samples, environmental samples, and 
the like are also provided. 

As used herein, the term "nucleic acid molecule" may refer to a DNA or RNA molecule 
1 5 which has been isolated free of total genomic nucleic acid. Therefore, a "purified" nucleic acid 
molecule, as used herein, refers to a nucleic acid molecule that contains ribosomal RNA or a 
ribosomal DNA gene or fragment thereof, yet is isolated away from, or purified free from, total 
cellular RNA or total cellular DNA. "Purified" nucleic acid molecules also refer to those made 
synthetically. Included within the term "nucleic acid molecule" are nucleic acid segments and 
2 0 smaller fragments of such segments. 

The term "rRNA or rDNA" is used for simplicity to refer to 16s ribosomal ribonucleic 
acid or 16s ribosomal deoxyribonucleic acid. Ribosomal RNA is usually single-stranded and 
will hybridize to complementary RNA or DNA fragments, or if self-complementarity exists, will 
2 5 hybridize to itself to form double-stranded regions. An oligonucleotide probe may also hybridize 
to a complementary region of RNA. Ribosomal DNA, as used herein, refers to the gene that 
encodes rRNA; rDNA is usually double-stranded, can be melted to a single-stranded form, and 
will hybridize to complementary RNA or DNA fragments. An oligonucleotide probe will also 
hybridize to a complementary region of double-stranded DNA to form a triple helix region. 

30 
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Table 1 lists the identity of sequences of the present disclosure having sequence 



identifiers. 

Table 1 

Identification of Sequences Having Sequence Identifiers 



SEQ ID NO: 


IDENTITY 


1 


nucleotide sequence of a forward primer for amplifying 16s rDNA gene by PCR 




nuvieoiide secjuent'e ui a reverse primer ior ampuiying ios rjL/i\/\ gene oy i v^iv 




iiuvicuuue oei^uenue ui 100 i oi Dnigeiiu j i&jirieri proviucu uy me prc&cni 
disclosure 


A 


nuvieoiide sequence ui ios fuin/a. oi £>nigeiiu sonnei provided oy ine present 
disclosure 


*> 


nucleotide sequence ox ios rivrN/\ oi onigeuci ciytsenieriue provided oy ine present 
disclosure 


O 


nucleotide sequence oi ios ruiN/v oi omgeua ooyuii provided oy tne present 
disclosure 


7 


nucleotide sequence oi i os rjL/iN/\ oi xi. con strain / naving vjcnoanK loenuiier 
ECORRD from Genbank 


8 


nucleotide sequence of region 71-100 of E. coli 16s rDNA strain 7 


Q 


nucleotide sequence oi region / l-iuu oi o. jiexneri ios ruiN/Y provided oy tne 
present disclosure 


10 


nucleotide sequence of region 71-100 of S. sonnei 16s rDNA provided by the present 
disclosure 


11 


nucleotide sequence of region 71-100 of S. dysenteriae 16s rDNA provided by the 
present disclosure 


12 


nucleotide sequence of region 71-100 of S. boydii 16s rDNA provided by the present 
disclosure 


13 


nucleotide sequence of region 961-991 of E. coli 16s rDNA strain 7 


14 


nucleotide sequence of region 961-991 of S, sonnei 16s rDNA provided by the 
present disclosure 


15 


nucleotide sequence of region 1001-1040 of E. coli 16s rDNA strain 7 


16 


nucleotide sequence of region 1001-1040 of S, sonnei 16s rDNA provided by the 
present disclosure 


17 


nucleotide sequence of region 1001-1040 of S. dysenteriae 16s rDNA provided by 
the present disclosure 


18 


nucleotide sequence of 80-100 of S. boydii provided by the present disclosure 


19 


nucleotide sequence of 76-94 of & dysenteriae provided by the present disclosure 
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nucleotide sequence of 961-980 of S. sonnei provided by the present disclosure 


21 


a composite nucleotide sequence of 79-98 for E. coli 


22 


a portion of SEQ ID NO:20 prior to X as shown in Table 3 



A purified nucleic acid molecule of a Shigella species of the present invention may have 
a nucleotide sequence set forth as SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 ? or SEQ ID 
NO:6, or fragments of said sequences. Unique species-specific and genus-specific DNA probes 
are provided as SEQ ID NOS: 9-12, 14, 16-21, or fragments thereof. The probes are 
complementary to areas of rRNA or rDNA having species-specific nucleotides or genus-specific 
nucleotides. Probe molecules of the present invention are provided for detecting and 
discriminating in a test sample the presence of rRNA or rDNA molecules of Shigella species and 
E. coli. 

In certain aspects, the invention concerns isolated DNA segments that include within 
their sequence a nucleic acid sequence essentially as set forth in SEQ ID NO:3-21. The term 
11 essentially as set forth" means that the nucleic acid sequence substantially corresponds to a 
portion of SEQ ID NO:3-21, and has few nucleotides that are not identical, or functionally 
equivalent, to the nucleotides of said sequences. The term "functionally equivalent nucleotide 11 is 
used herein to refer to nucleotides capable of hydrogen bonding to a complementary nucleotide 
in a similar manner and degree to that of a natural nucleotide. A thymine base may be replaced 
by a uracil base, for example. A "functionally equivalent nucleotide" includes, but is not limited 
to, derivatized nucleotides such as methylated nucleotides; nucleotide analogs such as xanthine, 
hypoxanthine, inosine, and the like. One of skill in the art, in light of the present disclosure, 
would be able to alter one or more nucleotides of a probe molecule as described herein and 
maintain ability to detect identifying nucleotides. 

Nucleic acid molecules may include additional residues, such as additional 5' or 3 1 
nucleotides, and yet still be essentially as set forth in one of the sequences disclosed herein, so 
long as the sequence meets the criteria set forth herein, including the maintenance of hybridizing 
activity. End-capped oligonucleotides may be incorporated into the nucleic acid molecules of the 
present invention. End-capped probes contain chemical moieties that block the 3' and/or 5 1 
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terminal hydroxyl groups of the oligonucleotide chains providing stability since capped probes 
are no longer susceptible to degradation by exonucleases known to degrade oligonucleotides. 
Indicator molecules, such as fluorophores, radioactive labels, peptides, or the like may be 
attached to nucleic acid molecules of the present invention at the 3' end, the 5 f end, or at other 
5 locations on the molecule, and still fall within the scope of the claimed invention. 



The present invention includes a purified nucleic acid molecule complementary, or 
essentially complementary, to the nucleic acid molecule having the sequence set forth in SEQ ID 
NO:3-21 or a fragment thereof. Nucleic acid sequences which are "complementary" are those 
1 0 that are capable of base-pairing according to the standard Watson-Crick complementarity rules. 
As used herein, the term "complementary sequences" means nucleic acid sequences which are 
substantially complementary or as defined as being capable of hybridizing to the nucleic acid 
segment of said sequences under relatively stringent conditions. Complementary nucleotide 
sequences are useful for detection and purification of hybridizing nucleic acid molecules. 

15 

For general reviews of synthesis of DNA, RNA, and their analogues, see 
Oligonucleotides and Analogues, F. Eckstein, Ed., 1991, IRL Press, New York; Oligonucleotide 
Synthesis, MJ. Gait, Ed., 1984, IRL Press Oxford, England; Caracciolo et al (1989); 
Bioconjugate Chemistry, Goodchild, J. (1990); or for phosphonate synthesis, Matteucci, MD. 
20 et al, Nucleic Acids Res, 14:5399 (1986) (the references are incorporated by reference herein). 



In general, there are three commonly used solid phase-based approaches to the synthesis 
of oligonucleotides containing conventional 5 ! -3 ! linkages. These are the phosphoramidite 
method, the phosphonate method, and the triester method. 

25 

A brief description of a current method used commercially to synthesize oligomeric DNA 
is as follows: Oligomers up to ca. 100 residues in length are prepared on a commercial 
synthesizer, eg., Applied Biosystems Inc. (ABI) model 392, that uses phosphoramidite 
chemistry. DNA is synthesized from the 3' to the 5' direction through the sequential addition of 
3 0 highly reactive phosphorous(III) reagents called phosphoramidites. The initial 3' residue is 
covalently attached to a controlled porosity silica solid support, which greatly facilitates 
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manipulation of the polymer. After each residue is coupled to the growing polymer chain, the 
phosphorus(III) is oxidized to the more stable phosphorus(V) state by a short treatment with 
iodine solution. Unreacted residues are capped with acetic anhydride, the 5*- protective group is 
removed with weak acid, and the cycle may be repeated to add a further residue until the desired 
5 DNA polymer is synthesized. The full length polymer is released from the solid support, with 
concomitant removal of remaining protective groups, by exposure to base. A common protocol 
uses saturated ethanolic ammonia. 

The phosphonate based synthesis is conducted by the reaction of a suitably protected 
1 0 nucleotide containing a phosphonate moiety at a position to be coupled with a solid phase- 

derivatized nucleotide chain having a free hydroxyl group, in the presence of a suitable activator 
to obtain a phosphonate ester linkage, which is stable to acid. Thus, the oxidation to the 
phosphate or thiophosphate can be conducted at any point during synthesis of the oligonucleotide 
or after synthesis of the oligonucleotide is complete. The phosphonates can also be converted to 
1 5 phosphoramidate derivatives by reaction with a primary or secondary amine in the presence of 
carbon tetrachloride. 

In the triester synthesis, a protected phosphodiester nucleotide is condensed with the free 
hydroxyl of a growing nucleotide chain derivatized to a solid support in the presence of coupling 
2 0 agent. The reaction yields a protected phosphate linkage which may be treated with an oximate 
solution to form unprotected oligonucleotide. 

To indicate the three approaches generically, the incoming nucleotide is regarded as 
having an "activated" phosphite/phosphate group. In addition to employing commonly used 
2 5 solid phase synthesis techniques, oligonucleotides may also be synthesized using solution phase 
methods such as diester synthesis. The methods are workable, but in general, less efficient for 
oligonucleotides of any substantial length. 

Preferred oligonucleotides resistant to in vivo hydrolysis may contain a phosphorothioate 
30 substitution at each base {J. Org. Chem., 55:4693-4699, (1990) and Agrawal, (1990)). 
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Oligodeoxynucleotides or their phosphorothioate analogues may be synthesized using an 
Applied Biosystem 380B DNA synthesizer (Applied Biosystems, Inc., Foster City, CA). 

Ribonucleotides may be synthesized with protecting groups on the free ribose oxygen, 
5 i.e., a O f -alkyl group, for example. The alkyl is preferably a methyl, although ethyl or propyl 
may be used also. 

Oligonucleotides may also be synthesized by application of nucleic acid reproduction 
technology, such as the PCR technology of U.S. Patent 4,683,202 (herein incorporated by 
10 reference). 

Hybridization is a process where, under predetermined reaction conditions, partially or 
completely complementary nucleic acids are allowed to come together in an antiparallel fashion 
to form a double-stranded or triple-stranded nucleic acid with specific and stable hydrogen 
1 5 bonds. 

Oligonucleotide probes are useful in hybridization embodiments such as Southern or 
Northern blotting, or dot blot hybridization, for detection of the presence in a test sample of 
species-specific or genus-specific nucleotides of the present invention. The total size of 
2 0 fragment, as well as the size of the complementary regions, will depend on the intended use or 
application of the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments where the length of the complementary region may be varied, such as 
between about 10 and about 40 nucleotides. The use of a hybridization probe of about 10 
nucleotides in length allows the formation of a duplex molecule that is both stable and selective. 

2 5 Molecules having complementary sequences over stretches greater than 10 bases in length are 

generally preferred to increase stability and selectivity of the hybrid, and thereby improve the 
quality and degree of specific hybrid molecules obtained. Nucleic acid probes having 
complementary regions of 15 to 25 nucleotides are most preferable. 

3 0 Standard hybridization conditions is used to describe those conditions under which 

substantially complementary nucleic acid segments will form standard Watson-Crick base- 
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pairing. A number of factor determine the specificity of binding or hybridization, such as pH, 
salt concentration, the presence of chaotropic agents (e.g. formamide and dimethyl sulfoxide), 
the length of the segments that are hybridizing, the base composition of the probe/target duplex, 
as well as by the level and geometry of mispairing between the nucleic acids. Stringency is also 
5 governed by reaction parameters such as the concentration and type of ionic species present in 
the hybridization solution, the types and concentrations of denaturing agents present, and/or the 
temperature of hybridization. Generally, as hybridization conditions become more stringent, 
longer probes are preferred if stable hybrids are to be formed. 

10 As used herein, "probe" refers to synthetic or biologically produced nucleic acids that by 

design or selection contain specific deoxyribonucleotide or ribonucleotide sequences that allow 
them to hybridize under defined predetermined stringencies, specifically and preferentially to 
target nucleic acid sequences. A target nucleic acid sequence is one to which a particular probe 
is capable of preferentially hybridizing. As used herein, a target nucleic acid sequence contains a 

15 species-specific or genus-specific nucleotide. 

Varying conditions of hybridization may be used to achieve varying degrees of selectivity 
of probe towards a target sequence. Stringent hybridization conditions may include 
hybridization to target nucleic acid sequences at 40° -65° C for 14-16 h in a hybridization 
2 0 solution containing 0.9 M NaCl, 0. 12 M Tris-HCl, pH 7.8, 6mM EDTA, 0. 1 M sodium 

phosphate buffer, 0.1% SDS, 0.1% polyvinylpyrrolidone, followed by three, 15-minute washes at 
40°-65 ° C to remove unbound probes in a solution containing 0.075 M NaCl, 0.0075 M Na 
Citrate, pH 7 and 0.1% SDS. Such selective conditions tolerate little, if any, mismatch between 
the probe and a test sample, and would be particularly suitable for detecting species-specific or 

2 5 genus-specific nucleotides. 

The control of stringency can take place in the hybridization procedure. However, it is 
sometimes conveniently controlled in the post hybridization stringency washes. Stringency 
washes are usually performed at 3-5 °C below the Tm of the perfectly matched probe where 

3 0 discrimination from mismatched sequences is required. The Tm of an oligonucleotide can be 

calculated according to the rule: 

17 
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Tm=(4x number of G+C bases) + (2x number of A+T bases), 
though it must be appreciated that this value is only an approximation (particularly for probes 
greater than 25 bases in length). Stringency can be controlled by alterations in the temperature or 
by changing the SSC concentration in the stringency wash. 

5 

Hybridization probes described herein are useful as reagents in solution hybridization as 
well as in embodiments employing a solid phase. In embodiments involving a solid phase, the 
test DNA (or RNA) is adsorbed or otherwise affixed to a selected matrix or surface. This fixed 
nucleic acid is then subjected to specific hybridization with selected probes under desired 

1 0 conditions. The selected conditions will depend on the particular circumstances based on the 
particular criteria required (depending, for example, on the G+C contents, type of target nucleic 
acid, source of nucleic acid, size of hybridization probe, etc.). Following washing of the 
hybridized surface so as to remove nonspecifically bound probe molecules, specific hybridization 
is detected, or even quantified, by means of the label. A wide variety of appropriate indicator 

15 means are known in the art, including fluorescent, radioactive, enzymatic or other ligands, such 
as avidin/biotin, which are capable of giving a detectable signal. In preferred embodiments, a 
fluorescent label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, may be 
used. In the case of enzyme tags, colorimetric indicator substrates are known which can be 
employed to provide a means visible to the human eye or spectrophotometrically, to identify 

2 0 specific hybridization with complementary nucleic acid-containing samples. Further, 

chemiluminescence, bioluminescence or radioactive indicators are recorded on sensitive film and 
the data converted to digitized information. 

Further methods for detecting and identifying species-specific and genus-specific 
25 nucleotides of the present invention include direct sequencing of the region including said 

nucleotide, PCR amplification of said region followed by direct sequencing or hybridization or, 
if said nucleotide forms or destroys a restriction site, then restriction fragment length 
polymorphisms may be relied upon. 

30 In another aspect, the present invention contemplates a diagnostic kit for screening a test 

sample for the presence of Shigella sp. or E. coli. Such a kit would contain a nucleic acid probe 
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having specificity for a species-specific or genus-specific nucleotide provided by the present 
disclosure. Nucleic acid molecules, as probes, can be used in ELISA type assays. Nucleotides 
can be synthesized that carry peptides and these peptides could be identified with monoclonal or 
polyclonal antibodies. If the antibodies were linked to a signaling system such as an enzyme 
5 (horseradish peroxidase, for example), bioluminescence, chemiluminescence, radioactivity, 

fluorescence, or other known signal-generating systems, then the ELISA test could be developed. 
The kit may contain probes to be used as controls for the hybridization tests. 

A test kit may include nucleic acid molecules having a nucleotide sequence of SEQ ID 
10 NOS: 9-12, 14, 16-21, or fragments thereof together with a negative control and a positive 
control. The negative control would help rule out false positive results whereas the positive 
control would help rule out false negative results. A specimen would be placed on a supporting 
surface such as, but not limited to, a membrane, ELISA well, gel, or other solid surface or placed 
in solution. Cells would be lysed with a lysing buffer and the DNA or RNA either immobilized 
15 on the support surface or solubilized in solution. Each specimen would be reacted in separate 
tests with one or more of the probes either simultaneously or sequentially in hybridization or 
other type tests. Detection of hybridization would be made through fluorescence analysis, 
bioluminescence, chemiluminescence, radioactivity counting, or other analyses. When employed 
in an ELISA-type assay, the development of color in solution or as a spot or through other type 
20 of standard analyses would be indicative of a positive reaction. 

Positive reactivity with a probe specific for E, coli would identify the specimen as such. 
Such specimens would be unreactive with the Shigella probes. Similarly, one or more probes 
that elicit a positive response and are identifiers for Shigella would indicate the presence of 
2 5 Shigella in the specimen. Such specimens, if they contain only Shigella, should be unreactive 

with a probe specific for E. coli. One or more of the probes may react with one or more Shigella 
species, providing discrimination not only between E. coli and Shigella, but between one strain 
of Shigella and another. Thus, the probes may be capable of enhanced discrimination and may 
identify not only Shigella but a particular strain, such as S. sonnei. 

30 
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The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
5 practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

Example 1 

1 0 Preparation of Stock Cultures 

The present example provides for the preparation of stock cultures for the studies 
provided herein. Strains of Shigella sonnei (ATCC 29930), Shigella dysenteriae (ATCC 13313), 
Shigella boydii (ATCC 8700), and Shigella flexneri (ATCC 29903) were obtained from the 
American Type Culture Collection (Rockville, MD). Escherichia coli strains 0157:NM (CDC 

1 5 G5066) and 01 57:H7 (CDC G5039) were provided by the Center for Disease Control (Atlanta, 
GA). For each of the four Shigella strains, lyophilized organisms were suspended in LB Broth 
and incubated in a shaker for several hours. A loopful of culture was removed and plated on LB 
agar overnight. Fresh isolated colonies were then incubated into LB broth and again grown 
overnight to prepare stock culture. Stocks were stored three different ways: on LB agar slants at 

2 0 room temperature, in LB broth containing 30% glycerol at -70°C, and in LB broth containing 
30% glycerol in liquid nitrogen. 

Example 2 
Extraction of Genomic DNA 

2 5 The present example provides methods used for extraction of genomic DNA from each of 

the four Shigella species. For extraction of genomic DNA, single colonies from the LB agar 
plate were grown in LB broth for several hours in a shaker incubator. This broth culture was 
then inoculated into a larger amount of LB broth in a 1 -liter flask and shaken overnight at 35 °C. 
Pellets of Shigella cells were collected by centrifugation and resuspended in buffer. Suspended 

3 0 pellets were transferred to sterile glass tubes, and sodium dodecyl sulfate and proteinase K were 

added to break open the cells. After adding salt and mixing thoroughly, prewarmed 
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hexadecyltrimethylammonium bromide containing salt was mixed in. The mixture was briefly 
incubated in a water bath followed by the addition of chloroform:isoamyl alcohol for extraction. 

The top, aqueous layer containing the nucleic acids was transferred to a fresh tube and the 
5 extraction steps repeated. To the final top, aqueous layer was added pure isopropanol, which 
resulted in the appearance of a stringy, white DNA pellet containing RNA that came out of 
solution and condensed into a tight mass. The nucleic acid was spooled up onto a glass rod, 
washed with 70% ethyl alcohol to remove residual chemicals, and dried under a vacuum/heater. 

10 The dried nucleic acids were resuspended in buffer, RNase A was added, and the mixture 

incubated in a water bath to eliminate RNA contaminants. DNA in solution was then re- 
precipitated by adding very cold ethyl alcohol. Precipitated DNA was spooled using a glass rod 
and washed again with ethyl alcohol before being dried under a vacuum/heater. The dried DNA 
pellets were resuspended in buffer, the DNA concentration was determined, and each solution 

1 5 was diluted to a final concentration of 1 00 ng/|il. 

Example 3 
Amplification of the rDNA Gene by PCR 

Amplifying the 16S ribosomal RNA gene involved the use of two primers having 

2 0 nucleotide sequences as shown by SEQ ID NOS: 1 and 2, buffer, magnesium chloride, 

deoxynucleotides, and the enzyme Tag polymerase. The primer sequences were based on 

literature reports for amplifying the 16S ribosomal RNA gene from E. coll A separate reaction 

tube was used for each of the four Shigella species. The samples were then placed in a 

thermocycler, which used the following procedure in which the programs were linked, with 1 to 

25 2 to 3 to 4: 

95 °C for 6 minutes to denature double-stranded DNA One cycle 

92 °C for 2 minutes to denature the DNA strands Thirty cycles with next two steps 

42 °C for 45 seconds to anneal primers 
72 °C for 4 minutes to extend the DNA chain 
30 72 °C for 20 minutes to complete the extension One cycle 

4°C to hold the reaction mixture for further analysis 
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PCR products were run in agarose gels and fragment size was analyzed with a DNA 
ladder consisting of synthetic oligonucleotides of different sizes ranging from 100 base pairs to 
22,000. For comparisons, E. coli strains that produce symptoms similar to Shigella were also 
tested in the PCR reaction after DNA had been isolated from them. Other species tested in the 
5 PCR reactions included Bacillus subtilis as well as different Vibrio species. 

Example 4 
Cloning the Amplification Product 

The TA™ Cloning Kit from Invitrogen (San Diego, California) was used together with 

1 0 the following strategy. The kit is designed for cloning PCR products directly from a PCR 

reaction without the need for modifying enzymes, purification, or digestions with restriction 

enzymes. The TA cloning vector, pCR™ II, contains the lacZa complementation fragment for 

blue-white color screening, ampicillin and kanamycin resistance genes for selection, and a 

versatile polylinker that contains EcoRI and Sp 6 and T7 priming sites. The amplified genomic 

1 5 material was ligated into the vector, using a ligation reaction for each Shigella species that 

included buffer, vector, PCR product, and T4 DNA ligase. The mixtures were incubated in a 

12 °C water bath overnight. The expected result was to obtain a vector containing the genomic 

insert. 

2 0 E. coli competent cells were then used to transform the vector containing the ligated 

DNA. The cells had a transformation efficiency of at least 1 x 10 8 transformants per microgram 
of supercoiled plasmid. For transforming the cells, LB agar plates were prepared with X-gal and 
ampicillin. The X-gal is a histochemical substrate for p-galactosidase and yields a blue 
precipitate on hydrolysis. Bacteria containing active p-galactosidase produce blue colonies when 

2 5 grown on media containing it. Cloning an insert into the pCR™ II plasmid disrupts the lacZa 

complementation fragment, preventing the production of active p-galactosidase. 

The ligation reaction mixture was added to competent cells. Controls consisted of adding 
pUC18, a test plasmid. The cells were first kept on ice and briefly heat shocked to enhance 

3 0 plasmid uptake into the cells. After a further incubation, the cells were plated onto the prepared 

agar plates. The competent cell controls were plated onto plates devoid of ampicillin. 
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The pUC18 control showed an excellent reaction with thousands of blue colonies and less 
than 1% white. The plates with transformants for the 4 Shigella species showed mostly white 
colonies with about 5% blue colonies. The white transformants were picked and plated again on 
5 the X-gal-LB-ampicillin plates to check further for up to 48 hours of incubation at 35 °C. The 
growth of all transformants on LB-kanamycin plates was also checked. All 40 transformants 
grew, showing the presence of plasmid PCR™ II, which confers resistance, in the transformants. 

Example 5 

1 0 Purifying the Cloned Plasmid DNA 

Insta-Mini-Prep™ Kits obtained from 5 Prime-3 Prime, Inc. (Boulder, Colorado) were 
used to purify the cloned DNA from each of the four Shigella species. Transformed cells were 
grown in LB-ampicillin overnight. Plasmid DNA was extracted from each cell pellet with 
phenol-chloroform-isoamyl alcohol. The Insta-Mini-Prep column was added to the samples and 
1 5 then centrifuged. These steps were repeated several times until the last centrifugation left a top, 
clear aqueous phase containing the plasmid whereas the gel trapped unwanted contaminants. 
The recovered top phase was stored at 4°C for analysis by PCR amplification and restriction 
digestion. 

20 Example 6 

Identifying the Cloned Segment 

Three different tests were performed to verify that the correct genomic material from 

Shigella had been cloned. The first test consisted of digesting the plasmid miniprep with the 

restriction enzyme EcoRl and comparing the results on gels. Digestion reactions for each 

2 5 Shigella species included samples of the miniprep DNA, bovine serum albumin, and EcoKL 

enzyme obtained from Promega (Madison, WI). Samples were incubated in a water bath for 1 h, 
put on ice, and then analyzed with 0.8% agarose gels. The results showed patterns that were 
consistent with the presence of a 1,500 kb insert, the size expected based on the PCR reactions, 
in the plasmid. The results suggest that the 1,500-kb insert contains an J^coRI-restriction site that 

30 is cleaved to an 800 bp fragment and a 700 bp fragment by complete digestion with the 
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restriction enzyme. The plasmid consists of 3.9 kb, which increases to 5.4 kb with the 1.5 kb 
insert. 

The second set of tests focused on the use of primers rpl and fD2, the same 16S 
5 ribosomal RNA primers as used previously to amplify the genomic DNA. The primers were 
used to specifically amplify the insert of 16S ribosomal RNA, if it were present in the plasmid. 
After amplification, the products were separated by gel electrophoresis to determine whether the 
band patterns were consistent with the size of the expected product. The gels showed that the 
primers had resulted in the amplification of a 1,500 bp fragment, as expected. 



The third set of experiments depended on identifying PCR products that resulted from 
PCR amplification of minipreps using universal vector primers Sp6 and T7 (Invitrogen, San 
Diego, California). The primers were added to a reaction mixture containing plasmid miniprep 
from each of the four Shigella species. The thermocycler program used was the same as in the 
1 5 previous PCR studies. The amplified products were then examined by using 0.8% agarose gels. 
Based on the known sequence of the plasmid, the products showed a 1,600-bp fragment, which 
includes the 1,500 bp-insert and part of the vector. 



The present example provides for large-scale preparation of plasmid DNA from 
transformed cells that contain the 16S ribosomal RNA gene fragments. Transformants were 
grown on LB-kanamycin plates at 35 °C overnight. A single colony from each growth was 

2 5 picked and then subcultured. The culture was then inoculated into a 1 -liter flask and grown 

overnight. Cells were pelleted by centrifugation before being broken open with fresh lysozyme, 
sodium hydroxide, and sodium dodecyl sulfate. Potassium acetate and glacial acetic acid were 
added to the reaction mixture. After clarifying the solution by low-speed centrifugation, 
isopropanol was added, the mixture was re-centrifuged, and the precipitated DNA pellet was 

3 0 dried in a vacuum heater. 



10 
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Example 7 

Large-Scale Preparation of Plasmid DNA Containing 
the 16S Ribosomal RNA Gene Fragment 
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The extracted circular or closed plasmid DNA was further purified by high-speed 
centrifugation in cesium chloride. The procedures included the addition of cesium chloride and 
ethidium bromide followed by low-speed centrifugation to remove suspended particles. The 
clarified solution was poured into plastic centrifuge tubes, which were then sealed and balanced. 
5 The tubes were centrifuged at 60,000 rpm overnight at 20°C. The circular plasmid band was 
then collected under long-wave ultraviolet light by puncturing the tube with an 18-gauge needle. 
The plasmid DNA was then treated with sodium chloride-saturated isopropanol. The upper pink 
layer was discarded, and the process repeated until both layers remained colorless. The lower 
aqueous phase containing DNA was precipitated by standard procedures, treated with RNase, 
1 0 washed, dried, and resuspended in buffer for sequencing. 

Example 8 

Sequences of 16s rDNA genes from Shigella Species 
Sequencing reactions were carried out using Perkin Elmer Dye Terminator Cycle 
15 Sequencing Ready Reaction Kit Pat. No. 402079 with fluorescent terminators (Perkin Elmer, 
Foster City, California). Single-stranded DNA template (0.5 \xg) or double-stranded DNA 
template (0.2-0.5 [ig), 0.8 pmol of primer for ss DNA or 3.2 pmol of primer for ds DNA, were 
combined in a final reaction volume of 20 \x\. Standard manual sequencing procedures as 
recommended by Perkin Elmer were followed. The product was resuspended in loading buffer, 
2 0 boiled and loaded. 

The 16s rDNA gene sequences are provided as SEQ ID NO:3 for Shigella flexneri, as 
SEQ ID NO:4 for Shigella sonnei, as SEQ ID NO:5 for Shigella dysenteriae, and as SEQ ID 
NO:6 for Shigella boydii. The first nucleotide of each sequence corresponds to nucleotide #8 of 
2 5 the E. coli sequence (SEQ ID NO:7). Therefore, to match a particular position in a Shigella 
sequence to the corresponding position in the reference E. coli sequence, the number seven is 
added to the position number of a Shigella nucleotide to match with the corresponding position 
of an "i?. coli equivalent nucleotide". 
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Example 9 

Genus-specific and Species-Specific Nucleotides 

The numbering of the nucleotides that are genus-specific or species-specific, as used 
herein and in Tables 2 and 3, result from aligning the obtained sequence with the sequence of the 
16s rDNA gene from E. coli which is provided as SEQ ID NO:7, and assigning the 
corresponding number of the E. coli sequence to those bases that are different or that are missing 
in the Shigella sequence. Base insertions in the Shigella sequence are referred to as an "p" base, 
i.e., an insertion following base 88, for example, is referred to herein as base 88p so as to keep 
the numbering consistent with the E. coli numbering sequence. Alignment was done manually; 
probe sequences were analyzed against all known 16s RNA sequences of the Aligned Sequence 
Program, Ribosome Database Project, University of Illinois, Dept. of Microbiology, Urbana- 
Champaign, Illinois, and against all sequences of Genbank, operated by the National Library of 
Medicine. 

Table 2 provides sequence comparisons of 16s rDNA from Shigella species and E. coli 



species. 



Table 2. Sequence comparisons pi 


F 16s rDNA from Shigella sp. and E. coli sp. 


ORGANISM 


REGI<bl 




SEQUENCE 


E. coli 
(ECORRD) 1 
SEQ ID NO: 8 


71-10 


0 


80 90 100 
AACAGGAAGA AGCTTGCTCT TT GCTGACGA 


E. coli (rrnG) 
(ECRRNGPK3) 


71-10 


0 




E. coli (rrrnE) 
(ECRRNEPK3) 


71-10 


) 




E. coli (rrnD) 
(ECRNNDPK3) 


71-10 


) 

h 


AC G /C\ — 


E. coli (rrnC) 
(ECRRNCPK3) 








E. coli (rrnB) 
(ECRRNBPK3) 


71-10 


0 




E. coli 
ATCC 25922 


71-10 


1 


CG /G\ 


E. coli (rrnB) 
(ECRRNBZ) 


71-10 
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E, coli 

AX L.U XX/ / O 1 




100 


C /G\ 


E. coli (rrnB) 

( T? r^* E> "D "NT "D 7 \ 


71- 


100 


/TN 


£. coli K-12 

\ xii^Ul or ] 


11- 


100 


m 


COJ.X (rrriD) 
(ECORGNB) 


/ X " 


i nn 
x u u 


/T\ 


£i . CO A i 

(ECOUW8 9) 


/ ± 


i nn 
x u u 




£. coli 
(ECOUW8 9) 


/ x- 


i n n 

xuu 


/T\ 




t 






S. flexneri 
ATCC 12022 


71- 


100 




S. flexneri 
French Strain 


71- 


100 


c G /CN 


S. flexneri 
ATCC 29903 
(present 
disclosure ) 
SEQ ID NO: 9 


71- 


100 












S. sonnei 
ATCC 92 90 


71- 


100 


AC G _ __ /cx 


S. sonnei 

French Strain ( 


/ 71 _ 


:.oo 

y\ 


____ AC G /c \ — — 


S. sonnei 
ATCC 2 9930 
(present 
disclosure ) 
SEQ ID NO: 10 


71- 


:.ooy 

i 

i 
i 


AC G /C\ 




I 
j 

i 




S. dysenteriae 
ATCC 13313 
(present 
disclosure) 
SEQ ID NO:ll 


71- 


{LOO 

1 

1 




S. dysenteriae 
French Strain 


71- 


I 100 
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S - civs (?*n t~ p v 7 (9 f 1 * 
ATCC 13313 


7 


1-100 










. 


£ h r> \/d 7 7* 
ATCC 8700 
(present 
disclosure) 
SEQ ID NO: 12 


7 


1-100 




S. boydii 
ATCC 9207 


t 


1-100 


c G- -/C\ 










E. coli 
(ECORRD) 
SEQ ID NO: 13 




61-991 


a '~i a A A A aaa 

970 980 990 
CGATGCAACG CGAAGAACCT TACCTGGTCT T 


E. coli (rrrnG) 
(ECRRNGPK3) 


9 


61-991 







E. coli (rrrnE) 
(ECRRNEPK3) 


9 


61-991 







E. coli (rrrnD) 
(ECRRNDPK3) 


9 


51-991 




. - 


£?• coli (rrnC) 
(ECRRNC) 








© x— ^ ^ X 





E. coli (rrnB) 
(ECRRNB) 


9 


61-991 







E* coli 
ATCC 25922 


9 


61-991 


; — 




E. coli (rrnB) 
(ECRRNBZ) i 


/ 


f>i->91 




„ . — - 


E. coli I 
ATCC 11775T \ 


9 








E. coli 
(ECRRNBZ) 


9 


61-991 

1 







E. coli 
(ECOTSF) 


9 


61-991 







E. coli (rrnB) 
(ECORGNB) 


9 


61-991 






E. coli 
(ECOUW89) 


9 


61-991 






E. coli 
(ECOUW89) 


9 


61-991 
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S. flexneri 

7\rnpr< 1 9 0 9 9 


961 


-991 






French Strain 




-QQ1 






S. flexneri 
ATCC 29903 
(present 

Hi crl piQnrp ^ 

SEQ ID NO: 13 


961- 


-991 













f 


5. sonnei 

nIL<^ tL. z> z) O \J 

(present 
disclosure) 
SEQ ID NO: 14 


961 


■-991 


c _ x 


S. sonnei 
French Strain 


961 


-991 







o * sonnej. 
ATCC 92 90 




zr z> J. 














S. dysenteriae 

(present 
disclosure) 
SEQ ID NO: 13 


961, 


-991 






S. dysenteriae 1 
French Strain I 


961 


7^ 


1 






S, dysenteriae 
ATCC 13313 


961 


-99 


1 

1 














S bo\/dii 
ATCC 9207 


961 


-991 






S. boydii 
ATCC 8700 
(present 
disclosure) 
SEQ ID NO: 13 


961 

i 


-991 








! 

! 
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E. coli 
(ECORRD) 
SEQ ID NO: 15 



1001.- 
1041 



1010 1020 1030 

GGAAGTTTTC AGAGAT GAGA ATG TGCCTTC 

1040 
GGGAACCGTG 



E. coli (rrnD) 
(ECORRNDPK3) 



ioo: 

1041 



E. coli (rrnC) 
(ECORRNCPK3) 



ioo: 

1041 



-NN 



E. coli (rrnB) 
(ECRRNBPK3) 



1001 
1040 



E. coli 
ATCC 25922 



1001 
1040 



E. coli 
(ECRRNBZ ) 



1001 
1040 



E. coli 
ATCC 11775T 



1001 
1040 



E. coli (rrnB) 
(ECRRNBZ) 



1001 
1040 



E. coli 
(ECOTSF) 



1001 
1040 



A C C- 



x — T — /G\-- 



E. coli (rrnB) 
(ECORGNB) 



1001 
1040 



E. coli 
(ECOUW89) 



1001 
1040 



E. coli 
(ECOUW8 9) 



1001 
1040 



S. flexneri 
ATCC 12022 



10-01 
1040 



S. flexneri 
French Strain 



1001 
1040 



S. flexneri 
ATCC 29903 
(present 
disclosure) 
SEQ ID NO: 15 



1001 
1040 
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5. sonnei 

(present 
disclosure) 
SEQ ID NO: 16 


1001- 

i A/in 
X u *± u 




A C C- X — T — /G\ 


S. sonnei 
French Strain 


1001 J 
1040 




c _ GA- -A-/G\ 


O • o LJIli 1 cr _L 

ATCC 92 90 


1040 
















S. dysenteriae 

/ll^^ X O X J5 

(present 
disclosure) 
SEQ ID NO: 17 


1001- 

± U ft u 




A C— C- X — /T\TG 

T ___ 


S. dysenteriae 
French Strain 


1001- 
1040 








S. dysenteriae 
ATCC 13313 


1001- 
1040 




A CC — GT AC — GG- — 










o . DOyCLll 
ATCC 9207 


1UU1™ 

1040 








o . JDoyci J. J. 
ATCC 8700 
(present 
disclosure) 
SEQ ID NO: 15 


i n m — 

1UU1 

1040 
















£. coli 
(ECORRD) 


f 


/ 


agctagtagg tggggtaacg gctcacctag 
Igcgacg 


E. coli (rrnG) 
(ECRRNGPK3) 




ro 




\ 


E. coli (rrnE) 
(ECRRNEPK3) 


245-2J 




- v 


E. coli (rrnD) 
(ECRRNDPK3) 


245-2J 


30 












S. boydii 
ATCC 8700 
(present 
disclosure) 


245-2? 


!o 
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1 Data for E. coli strain 7 are from the GenBank ECORRD Sequence Data Bank for 16s rDNA 
and are subject to Genbank descriptions. "X" as used herein means the absence of that 
nucleotide in comparison to the E. coli sequence. A dash "-" means the sequence is the same as 
that of E. coli. A symbol, "/ \" means an insertion of a nucleotide in a subsequent sequence as 

5 compared to that of E. coli ECORRD. 

2 The numbering of the region follows the numbering for the E. coli sequence ECORRD. 

1 0 From Table 2 and the present disclosure, one of skill in the art would be able to determine 

nucleotides or combinations of nucleotides at particular positions that would distinguish Shigella 
from E. coli and among Shigella species. For example, identifying nucleotides for Shigella 
sonnei include a C at position 964, or a deletion at position 978; identifying nucleotides for 
Shigella dysenteriae includes an A at position 76; identifying nucleotides for Shigella boydii 

1 5 include a C at position 92; and identifying nucleotides for E. coli include a T at position 88p. 

Table 3 provides a listing of exemplary genus-specific and species-specific probe 
molecules having select species-specific or genus-specific nucleotides distinguishing Shigella 
from E. coli 9 distinguishing E. coli from Shigella species, and identifying S. sonnei. 

20 

Table 3. Exemplary Genus-Specific and Species-Specific Probe Molecules 



Basis for Probe 
Selection 


Probe Sequence (20 mer) 




S. boydii 
(80-100) 


C AGCTTGCTCT TCGCTGACG 1 , 
SEQ ID NO: 18 


E. coli sequence shows a 
different set of nucleotides at 
underlined positions as seen 
in Table 2. 


S. dysenteriae 
(76-94) 


AAAGC AGCTTGCTCT TTGCT, 
SEQ ID NO: 19 


E. coli sequence shows a 
different nucleotide at 
underlined positions as seen 
in Table 2. 


S, sonnei 
(961-980) 


CGACGCAACG CGAAGAAXCT T 2 , 
SEQ ID NO:20 


Based on the data of Table 2, 
this sequence appears unique 
to S. sonnei. 
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E. coli 


GA AGCTTGCT/T\CT TTGCTGAC 3 , 


Presence of T at 88p 


(79-98) 


SEQ ID NO:21 


identifies E. coli. Shigella 






sequences differ. A few 






nucleotides toward the 3 1 






end, the insertion of a C in 






the Shigella sequence causes 






the sequence to realign 






correctly with the E. coli 






sequence. 



1 Bases underlined indicate a change in sequence relative to E. coli (ECORRD) 

2 X indicates the deletion of a base relative to the E. coli (ECORRD) sequence 

3 The symbol /T\ indicates a base insertion, in this case, insertion of T 

5 

One of skill in the art in light of this disclosure would be able to identify further 
combinations of nucleotides that would uniquely identify a particular strain of Shigella described 
herein by comparing nucleotides at particular positions. Such combinations are considered to be 
1 0 part of the present disclosure. 



Example 10 
Design of Probes for Identifying Shigella sp. 

15 Nucleic acid probes are designed for discriminatory hybridization to 16s rRNA or 16s 

rDNA from test samples for the identification of Shigella sp, based upon information provided 
herein and knowledge of the melting temperatures of probe sequences. The melting temperature 
of a DNA probe of less than about 40 bases can be calculated by the formula: 
(#purines x 4 degrees) H- (# pyrimidines x 2 degrees) = Tm. 

2 0 Thus, high GC containing probes have a higher melting temperature than probes having a high 
AT content. The general ability to discriminate between templates having a one base mismatch 
has been known in the field of nucleic acid hybridization for years. 



Specificity of hybridization is dependent upon probe length, about 15 bases being average 
2 5 for bacterial genomes. Shorter probe sequences may be used, as well as longer sequences; 
however, the effect of a one base mismatch is decreased with a longer probe. 



33 
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One of skill in this art would realize in light of the present disclosure that a probe nucleic 
acid molecule designed to detect a base match or mismatch at a certain nucleotide position would 
optimally be positioned near the middle of the molecule, i.e., there would be about as many 
nucleotides 5 f to the matched or mismatched site as are 3 1 to said site. Flexibility exists, 
5 however, especially since GC base pairs hydrogen bond more strongly than AT base pairs. An 
AT rich half of a candidate probe molecule in relation to said site may need to be made longer to 
account for the lower strength of binding. 

One of skill in this art in light of the present disclosure would also realize that a set of two 
10 or more probes (nucleic acid molecules) may be used to distinguish/identify a species or genus of 
bacteria. A first probe of a set may eliminate certain species/genera while a second probe of a set 
may eliminate further species/genera so as to provide identifications in combination with the 
results from using the first probe. 

1 5 Each nucleic acid probe is tested to demonstrate adequate exclusivity as to all other 

organisms, adequate inclusivity with respect to Shigella sp., accessibility of the target regions 
under various assay conditions that may be employed in test circumstances, and intramolecular 
interactions of the probe itself. 

2 0 All of the compositions and methods disclosed and claimed herein can be made and 

executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
composition, methods and in the steps or in the sequence of steps of the method described herein 

2 5 without departing from the concept, spirit and scope of the invention. More specifically, it will 
be apparent that certain agents which are both chemically and physiologically related may be 
substituted for the agents described herein while the same or similar results would be achieved. 
All such similar substitutes and modifications apparent to those skilled in the art are deemed to 
be within the spirit, scope and concept of the invention as defined by the appended claims. 

30 
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References cited herein, to the extent that they provide exemplary procedural or other 
details supplementary to those set forth herein, are specifically incorporated herein by reference. 
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